Best Polyphone Recognition AI Tools & Models - Premium Polyphone Recognition News

AI News

Volc Engine Launches Doubao Speech Recognition Model 2.0 to Improve Multilingual Recognition Accuracy

Volc Engine launches Doubao Speech Recognition Model 2.0, significantly enhancing inference capabilities and supporting multilingual and visual information recognition. The model is based on a 2 billion parameter audio encoder, optimized for complex scenarios, improving the accuracy of recognizing proper nouns, names, places, and polyphones.

18.6k 28 minutes ago

Volc Engine Launches Doubao Speech Recognition Model 2.0 to Improve Multilingual Recognition Accuracy

Models

Claude 3 Sonnet

Anthropic

$21

Input tokens/M

$105

Output tokens/M

200

Context Length

qwen3-vl-plus

Alibaba

Input tokens/M

$10

Output tokens/M

256

Context Length

qwen3-livetranslate-flaltimeash-re-2025-09-22

Alibaba

Input tokens/M

$240

Output tokens/M

Context Length

qwen3-omni-30b-a3b-captioner

Alibaba

$15.8

Input tokens/M

$12.7

Output tokens/M

Context Length

Doubao - Seedream - 4.0

Bytedance

Input tokens/M

Output tokens/M

Context Length

Doubao - Seedream - 3.0 - t2i

Bytedance

Input tokens/M

Output tokens/M

Context Length

Doubao-SeedEdit-3.0-i2i

Bytedance

Input tokens/M

Output tokens/M

Context Length

qwen3-asr-flash

Alibaba

Input tokens/M

Output tokens/M

Context Length

qwen-vl-plus

Alibaba

$0.8

Input tokens/M

Output tokens/M

128

Context Length

Qwen3-0.6B

Alibaba

$0.3

Input tokens/M

Output tokens/M

Context Length

Hunyuan-T1-Vision

Tencent

Input tokens/M

Output tokens/M

Context Length

QianfanHuijin-Reason-8B

Baidu

Input tokens/M

Output tokens/M

Context Length

QianfanHuijin-8B

Baidu

Input tokens/M

Output tokens/M

Context Length

Qianfan-QI-VL

Baidu

Input tokens/M

Output tokens/M

Context Length

Doubao-1.5-vision-pro-32k

Bytedance

Input tokens/M

Output tokens/M

Context Length

Doubao-1.5-vision-lite

Bytedance

$1.5

Input tokens/M

$4.5

Output tokens/M

128

Context Length

Pangu-AgentExpert-N1-0.0.2

Huawei

Input tokens/M

Output tokens/M

Context Length

GPT-4.5

Openai

$525

Input tokens/M

$1050

Output tokens/M

128

Context Length

Qwen_v2.5_0.5b_base

Alibaba

Input tokens/M

Output tokens/M

128

Context Length

Qwen_v2.5_0.5b_Instruct

Alibaba

Input tokens/M

Output tokens/M

128

Context Length

Empowering the future, your artificial intelligence solution think tank

English 简体中文繁體中文にほんご

FirendLinks:

AI Newsletters AI Tools MCP Servers AI News AIBase LLM Leaderboard AI Ranking

Business Cooperation Site Map